North Tyneside
'A lot of this is speculative': faith and fear mix amid 3tn global datacentre boom
Several new sites such as this are in the pipeline in the UK. Several new sites such as this are in the pipeline in the UK. 'A lot of this is speculative': faith and fear mix amid $3tn global datacentre boom The global investment spree in artificial intelligence is producing some remarkable numbers and a projected $3tn (£2.3tn) spend on datacentres is one of them. These vast warehouses are the central nervous system of AI tools such as OpenAI's ChatGPT and Google's Veo 3, underpinning the training and operation of a technology into which investors have poured vast sums of money. Despite concerns that the AI boom could be a bubble waiting to burst, there are few signs of it at the moment.
Evaluating LLMs' Assessment of Mixed-Context Hallucination Through the Lens of Summarization
Qi, Siya, Cao, Rui, He, Yulan, Yuan, Zheng
With the rapid development of large language models (LLMs), LLM-as-a-judge has emerged as a widely adopted approach for text quality evaluation, including hallucination evaluation. While previous studies have focused exclusively on single-context evaluation (e.g., discourse faithfulness or world factuality), real-world hallucinations typically involve mixed contexts, which remains inadequately evaluated. In this study, we use summarization as a representative task to comprehensively evaluate LLMs' capability in detecting mixed-context hallucinations, specifically distinguishing between factual and non-factual hallucinations. Through extensive experiments across direct generation and retrieval-based models of varying scales, our main observations are: (1) LLMs' intrinsic knowledge introduces inherent biases in hallucination evaluation; (2) These biases particularly impact the detection of factual hallucinations, yielding a significant performance bottleneck; (3) The fundamental challenge lies in effective knowledge utilization, balancing between LLMs' intrinsic knowledge and external context for accurate mixed-context hallucination evaluation.
NCL-SM: A Fully Annotated Dataset of Images from Human Skeletal Muscle Biopsies
Khan, Atif, Lawless, Conor, Vincent, Amy, Warren, Charlotte, Di Leo, Valeria, Gomes, Tiago, McGough, A. Stephen
Single cell analysis of human skeletal muscle (SM) tissue cross-sections is a fundamental tool for understanding many neuromuscular disorders. For this analysis to be reliable and reproducible, identification of individual fibres within microscopy images (segmentation) of SM tissue should be automatic and precise. Biomedical scientists in this field currently rely on custom tools and general machine learning (ML) models, both followed by labour intensive and subjective manual interventions to fine-tune segmentation. We believe that fully automated, precise, reproducible segmentation is possible by training ML models. However, in this important biomedical domain, there are currently no good quality, publicly available annotated imaging datasets available for ML model training. In this paper we release NCL-SM: a high quality bioimaging dataset of 46 human SM tissue cross-sections from both healthy control subjects and from patients with genetically diagnosed muscle pathology. These images include $>$ 50k manually segmented muscle fibres (myofibres). In addition we also curated high quality myofibre segmentations, annotating reasons for rejecting low quality myofibres and low quality regions in SM tissue images, making these annotations completely ready for downstream analysis. This, we believe, will pave the way for development of a fully automatic pipeline that identifies individual myofibres within images of tissue sections and, in particular, also classifies individual myofibres that are fit for further analysis.
Introducing NCL-SM: A Fully Annotated Dataset of Images from Human Skeletal Muscle Biopsies
Khan, Atif, Lawless, Conor, Vincent, Amy, Warren, Charlotte, Di Leo, Valeria, Gomes, Tiago, McGough, A. Stephen
Single cell analysis of skeletal muscle (SM) tissue is a fundamental tool for understanding many neuromuscular disorders. For this analysis to be reliable and reproducible, identification of individual fibres within microscopy images (segmentation) of SM tissue should be precise. There is currently no tool or pipeline that makes automatic and precise segmentation and curation of images of SM tissue cross-sections possible. Biomedical scientists in this field rely on custom tools and general machine learning (ML) models, both followed by labour intensive and subjective manual interventions to get the segmentation right. We believe that automated, precise, reproducible segmentation is possible by training ML models. However, there are currently no good quality, publicly available annotated imaging datasets available for ML model training. In this paper we release NCL-SM: a high quality bioimaging dataset of 46 human tissue sections from healthy control subjects and from patients with genetically diagnosed muscle pathology. These images include $>$ 50k manually segmented muscle fibres (myofibres). In addition we also curated high quality myofibres and annotated reasons for rejecting low quality myofibres and regions in SM tissue images, making this data completely ready for downstream analysis. This, we believe, will pave the way for development of a fully automatic pipeline that identifies individual myofibres within images of tissue sections and, in particular, also classifies individual myofibres that are fit for further analysis.
One in three councils using algorithms to make welfare decisions
One in three councils are using computer algorithms to help make decisions about benefit claims and other welfare issues, despite evidence emerging that some of the systems are unreliable. Companies including the US credit-rating businesses Experian and TransUnion, as well as the outsourcing specialist Capita and Palantir, a data-mining firm co-founded by the Trump-supporting billionaire Peter Thiel, are selling machine-learning packages to local authorities that are under pressure to save money. A Guardian investigation has established that 140 councils out of 408 have now invested in the software contracts, which can run into millions of pounds, more than double the previous estimates. The systems are being deployed to provide automated guidance on benefit claims, prevent child abuse and allocate school places. But concerns have been raised about privacy and data security, the ability of council officials to understand how some of the systems work, and the difficulty for citizens in challenging automated decisions.
Robots deliver award winning customer service in North Tyneside
In fact, the customer experience is now so good that the council and its service delivery partner, ENGIE, won'Best Application of Technology' at the UK Customer Satisfaction Awards 2016. The judges concluded that "work to develop the council's digital presence has been enormously successful, resulting in a vast improvement in customer service". North Tyneside and ENGIE worked with eforms specialist, IEG4, to create the new benefit claim process. Robotic process automation (RPA) enables one piece of software to talk to another piece of software whilst continuing to use the human user interface. In this case a software robot has been created and trained to do the repetitive work and processing involved in processing a housing benefit claim.